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(3) Hybrid polypeptide containing an avidin binding polypeptide. 



(57) A hybrid polypeptide is disclosed comprising 
an avidin-binding polypeptide containing a 
biotin binding domain fused to a polypeptide of 
interest wherein the avidin-binding polypeptide 
is 'upstream" of the polypeptide of interest 
The hybrid polypeptide is produced by recom- 
binant DNA techniques. The hybrid polypeptide 
may also contain a deavage site for cleaving the 
polypeptide of interest from the avidin-binding 
polypeptide by using an appropriate proteolytic 
or chemical reagent The hybrid polypeptide is 
expressed in appropriate host ceils transformed 
with the DNA expression vector encoding the 
hybrid polypeptide, and may be recovered from 
crude cell extracts in high yield and high purity 
using avidin affinity chromatography. Following 
avidin affinity purification, the polypeptide for 
attachment and polypeptide of interest may be 
cleaved to yield polypeptide of interest in a 
highly pure and highly active state. 
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The present invention relates to a hybrid polypeptide. 

In particular, the present invention relates to a recombinant hybrid polypeptide comorisino a no. 

of .teres, fused to an avidin-bindin 9 polypep.de. The av^-binding pol^ptid'e c^S^nt 

The present invention also relates to a nucleic add sequence that encode* for fho huK^ , 
process for producing the same and a process for recovering same * Polypeptde. a 

In particular, the nucleic acid sequence is a DNA expression vector 

«onr P S;lL s ~ 

cadavers, serum, and urine have been .^^^5X2? 

However, advances in recombinant DNA technology have made iiTSteS svn£~,^ f * m ""^ 
peptides possible and in commercial quantities, .n this regard. ^S^^JS^f^^ P °' y - 
thesisofcommercia.ly useful polypeptides can be introduced ^iwCSSJlSSSfS^ 1 ^ 
For example, recombinant DNA technology has enabled human growth hlZZ7o6u^^ 
bacteria and. today, fermentation replaces the traditional source. by recomb,nant 

Todate. biological synthesis is the only practical approach to the commercial seal. ■ . 

of greater than 20 amino acid residues. Once syntheseed. the <^tTZZtTZ^^J?T, 
from a complex mixture of cellular components. The degree of purification depends ^ZSZEStt 
cat,on of the polypeptide. The cost of purification can account for up to 70V. TZ cos' lllrL^Z K 
stantial f losses ,of active ingredient usua.ly occur during mu.tistep Z^ ,!^ """^^ 
Polypeptide purifications are usually achieved through one or more processes which are based uoon nhl «. - 
,ca. propert.es of the po.ypeptide of interest. For example, proteins may be separated on tZ bast^sZ!^ 

Affinity resin chromatography ear greatly reduce the numher of purification st.os r«w,.H *,„..• ... 
dee,,*, leyet of p u% . Affinity potion s based opon a specific SZ^! 

on the attachment domain are washed away from the resin 9 6 9 ™ P 

Therefore, in a single step, the polypeptide of interest may be recovered in high purity 
In order to use affin.ty chromatography to advantage for polypeptide purification recombinan. hma ta k_ 

nology can be used to construct chimeric gene fusions for remnant 1*55122 SStSZSZ 

fn JZ^m P T° tere ^ th ° Se WhiCh maximize «P fe ssion of the desired gene in the host cell and factors 
ers appropriate for express*, of doned genes includefce SSS^JR SSlJS?* Prom ° t ' 

inserted into a " ° 0mbmatron5 ^ ^ DNA SequenCeS are «••"**' '"*> a ^"C gene.ThS 

S imefest and ^ oZShV"^ T "* 3 ^ * appropriate host ■» *S 

S yP ^S? 6 f0 ! r attachment to » e affini * «sin are produced as a single polypeplde chain 

entS ^ « USC f ° r COmmercia, - Sca,e Polypeptide purification. For example, chimeric genes 

8^£ V T??J*T nin9 3 PO,y8r9inine ^i" 31 13,1 < Sassenfeld an ° Brewer. Bioichno.iy 2 7^ 

e 12X22 meta UteZ * (Smilh 61 * X Bi °' Ch6m 263:7211 < 1988 » «" fac3ilate ^paratton'byTon 
teracZ d!rT? I chromatography. Such systems are not broadly applicable because affinity Z 

i™T*l a ToZ^£ ° f ^ fUSi ° n P ° ,ypep0de »» chelate metafcland" 

^SZl^J ^ Cha " 9e th6Se Phys,cal properties to affi <% binding, 

are fCsS I to '"^^^ chromatography, wherein polypeptides of interest 
JSlr^nTSSTJ Pf0 ,r S 38 E 50,1 ^^^e (Ruther and Mul.er-Hi!.. EMBO J 2:1791- 
1794 (1983)) or small hydroph,hc pept.des (Hopp etal.. US 4.703.004 (1988)) to achieve purification. Polypep- 
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tides fused to staphylococcal protein A can be purified using IgG-Sepharose (Nilsson et al. EMBO J. 4:1075- 
1080 (1985), Lowenadier et al. EMBO J. 5:2393-2398 (1986)). Polypeptides fused to Protein G can be isolated 
using albumin as the immobilized ligand (Nygren et al. J. Mol. Recognition, 1:69 (1988)). 

A critical disadvantage limiting the usefulness of these prior art methods is that extreme conditions, indud- 
5 ing the use of denaturants, are necessary to remove the fusion proteins from the affinity resin, which may de- 
stroy biological activity if native folding cannot be achieved. Low product recovery rates can also limit the use- 
fulness of such systems. 

Affinity based upon the binding of small molecules by a large protein is known as substrate-affinity chro- 
matography. A small molecule, a ligand, forms a complex with a specific ligate. Examples of ligand:ligate corn- 
to binations include avidin:biotin (Green in Advances in Protein Chemistry Vol 29 pp 85-133 Anson et al. f 
Eds.(1975)). streptavidin:biotin (PCT/US85/01901. Meade and Garvin (1985)), lipoic acid:avidin (Green ibid), 
chloramphenicol acetyl transferase:acetyl CoA (EPO 0 1 31 363, Bennet et al. (1 984)), beta-galactosidase: para- 
aminophenyl-beta-D-thio-galactoside (Offensberger et al. Proc. Natl Acad. Sci USA 82:7540-7544 (1985)), 
phosphate binding protein:hydroxyapatite (Anba et al. Gene 53:219 (1987)), maltose binding protein:starch 
is (EPO 286239. Guan et al. 1988), and glutathione S-transferase:gIutathione (Smith and Johnson Gene 67:21- 
30(1988)). 

In recent years, the unique properties of the prosthetic group biotin and its exceptionally high affinity (1015 
M-1) and specificity for the proteins avidin and streptavidin (Green ibid.) have been exploited to devise powerful 
and widely applicable tools for microbiology, biochemistry and medical science (Wilchek and Bayer Analyt Bio- 

20 chem 171:1-32 (1988), Bayer and Wilchek Methods in Biochem Anal 26:1-45 (1980)). 

Biotin is a prosthetic group found on only a few protein species (Ann N.Y. Acad. Sci 447:1-441, Dakshina- 
murti and Bhagavan, Eds. (1985)). Attachment in vivo is mediated by biotin holoenzyme synthetases which 
recognizes a highly conserved attachment domain and catalyzes the covalent attachment of biotin to that do- 
main (Wood et al, J Biol Chem 225:7397-7409 (1980); Shenoy and Wood, FASCB S 2:2396-2401 (1988)). 

25 Experiments using recombinant DNA technology have shown that biotin holoenzyme synthetases will bio- 

tinylate heterologous polypeptides containing this conserved attachment domain. For example, the 1.3S sub- 
unit of the enzyme transcarboxylase from Propionibacterium, which contains the conserved sequence, when 
cloned and expressed in E. coli is biotinytated by the E. coli synthetase (Murtif et al. Proc Nat Acad Sci USA 
82:5617-5621 (1985)). 

30 A polypeptide or part of a polypeptide containing the conserved biotin attachment domain, such as entire 

1.3S (SEQ ID NO:1) protein or the biotin-binding recognition sequence identified within the 1.3S protein from 
Propionibacterium, (SEQ ID NO:2) can be incorporated into a hybrid recombinant polypeptide. Such a hybrid 
polypeptide containing a biotin attachment domain fused to one or more polypeptides of interest could be used 
to achieve the separation of virtually any recombinant protein based upon the affinity of the ligand avidin for 

35 the ligate biotin. 

Avidin:biotin chromatography shares advantages generally applicable to substrate affinity chromatography 
systems for commercial-scale polypeptide purification. Substrate-affinity resins are generally inexpensive. Fu- 
sion proteins can be recovered using mild conditions by elution with free ligand. 

Post-translationa! addition of the biotin prosthetic group is independent of the final folded state of the protein 
, r 40 (Wood et al. J Biol Chem 255:7397-7409 (1980)), an advantage when the host cell performs no post- transla- 
tional modifications on the recombinant polypeptide. 

A ligand domain such as the domain directing biotin attachment would therefore be particularly advanta- 
:% } geousfor recovery of fusion proteins found in inclusion bodies or for recovery of insoluble proteins which require 
f denaturants or zwitterionic detergents for solubilization during extraction, prior to affinity chromatography. 

4S PCT WO 90/14431, which names Cronan as an inventor, discloses a hybrid DNA sequence encoding a 

fusion protein comprising a first DNA sequence which encodes an amino acid sequence that allows for post- 
translation modification of the fusion protein; and a second DNA sequence joined end to end with the first DNA 
sequence and in the same reading frame, the second DNA sequence encoding a selected protein or polypep- 
tide. In each of the examples the first DNA sequence is fused to the 3' end of the second DNA sequence (i.e. 
£ so the first DNA sequence is downstream of the second DNA sequence). 

Also disclosed disclosed in PCT WO 90 / 14431 is a vector comprising the hybrid DNA, a host transformed 
with the vector, a method of producing a fusion protein by culturing the transformed host under conditions per- 
mitting expression of the fusion protein, a fusion protein comprising a selected protein or polypeptide linked to 
an amino acid sequence that allows for post-translation modification of the fusion protein and a method of iso- 
55 lating the fusion protein comprising providing a binding partner that binds to the fusion protein only after it has 
been modified, contacting the modified fusion protein with the binding partner under conditions permitting bind- 
ing, separating the modified fusion protein bound to the binding partner from unbound materials in the mixture, 
and eluting the modified fusion protein. 



3 
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The work of Cronan is also disclosed in J Biol Chem 265 10327 irvm Moom ,r> 
binant DNA plasmid from E. coli (Murtif e, a,. Prcc Nat Acad S fSl* SV/S^^ ! ^ 
struct fus.cn genes containing segments of the 1.3S gene, which contamTe bfotfn IteS^T'* * ^ 

Cronan (ibid) demonstrated that 1.3S sequences can be used to siffical lvth!f„? > ° ma,a 
purify proteins from crude ce.l .ysates by avidin affinity chromatography " Pr ° temS " a " d l ° 

As in each of the examples of PCT WO 90/1 4431 Cronan's r,w,ri\ rh; m „„v. „« 
ing the 3' end of the genes of interest to the 5' end of fhT 1 3S H ffeT ?5 ? ^ r^** b/ 
gene of interest), yielding hybrid recombinant polype hdes h^ downstream of the 

terminus of the 1 .3S polypeptide /peprraes naving the polypepbdes of interest fused to the N- 

polypeptide "eliminates biotinylation.- nyorophob.c.ty of the carboxyl terminal region of the 1.3S 

Murtif Samols did observe biotinylation of 1.3S polypeptides each lpnr.»h«.n»w k k 
1.3S carboxyl terminus. However such add!*™* «r k! • ! Ien 9thened by two ammo acids at the 

terminal and a cleavable CTSltSRl^ ^ haSa " an,i9enfc 
ecule. The linking portion of the idenSonTep id ^ SSt??iS?l? nd ^K,, ^ 
jacent the functional protein by use of a sequence P m ' n ° 3C ' d res,due ad " 

protein is cleaved from the isolated hybrid SCSJSSiS^^ * "T*"' ■° BBt Whe " 
state is released. As with Murtif and SamoE 2 TOT ™S gOM^l ^ " " V **** 3Ctive 

elled. chemically modified proteins £ v/voand ateo taSJtefnJ? J ^ We fused gene to produce lab- 
of the gene encoding the protein. The |£ £n ^^a ^NA^Z!„r "I SeQUenCe 
interested toasecond DNA fragment m^^^S^SSSS^i *T 7** ° f 
or a biotin derivative. The fused gene of U^jLjSa Z TJ, ? P b,n<,,n 9 site s^btotin 

megeneisinsertedintoasuitableexpress^onv^aS^ 

are separated by the addition of tiSS^Z^S^^ *'^^^^^ fa ^ 
the then-Known commercia. prepa^STp^ aat^bZ^n^fr 
tavidin which has all four valences free for biotin binding contaminant-free source of strep- 

However. as with Murtif and Samols (ibid). PCT WO 90/14411 rm« a „ m.™ ...... 

fused to the C terminus erf the a*2«, £„^ Wherei " the P"**** of interest is 

tional protein) is VoJsteZ^^^?*^ ^ of '»««* Ce the desired tunc 
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main, characterised in that the polypeptide of interest is fused to the C terminus of the avidin-binding polypep- 
tide. 

Preferably,- biotin is attached to the avidin-binding polypeptide. 

Preferably, the polypeptide includes a cleavage site for cleaving the polypeptide of interest from the avidin- 
5 binding polypeptide. This cleavage site may be between the polypeptide of interest and the avidin-binding poly- 
peptide or it may be integral with either or both of the polypeptide of interest and the avidin-binding polypeptide. 

Preferably, the cleavage site is as partic acid- proline, asparagine-glycine, methionine, cysteine, lysine-pro- 
line, arginine-proline, lysine-arginine or isoleucine-glutamic acid-glycine-arginine. 

Preferably, the avidin binding polypeptide is. or is part of, a 1.3S polypeptide. Preferably, the 1.3S poly- 
10 peptide is from Propionibacterium. 

Preferably, the biotin attachment domain of the avidin-binding polypeptide comprises at least one of the 
sequence: 

f5 Pro Ala Pro Leu Ala Gly Thr ValSer Lys De Leu Val Lys Glu 

Gly Asp Thr Val Lys Ala Gly Gin Thr Val Leu Val Leu Glu 
Ala Met Lys Met Glu Thr Glu De Asn Ala Pro Thr Asp Gly. 

20 

Preferably, the avidin-binding polypeptide comprises a plurality of non-contiguous and / or contiguous avi- 
din-binding polypeptides, which may be the same or different 

Preferably, the polypeptide of interest comprises a plurality of non-contiguous and /or contiguous polypep- 
tides of interest, which may be the same or different 
25 Preferably, the polypeptide of interest is an enzyme, is an antigen useful for vaccine production, or a di- 

agnostic reagent 

Preferably, the polypeptide of interest has antitumor activity or has an amino acid sequence for recognition 
of antigens. 

According to a second aspect of the present invention there is provided a nucleic acid sequence coding 
30 for a hybrid polypeptide comprising a polypeptide of interest and an avidin-binding polypeptide containing a 
biotin attachment domain, characterised in that the nucleic acid sequence coding for the polypeptide of interest 
is downstream of the nucleic acid sequence coding for the avidin-binding polypeptide. 

Preferably, the nucleic acid sequence is a DNA sequence. 

Preferably, the DNA sequence contains in a 5* to 3' direction on the coding strand a gene comprising a 5' 
35 promoter region, a DNA sequence coding for the avidin-binding polypeptide and the DNA sequence coding for 
a polypeptide of interest 

Preferably, the DNA sequence is, or is part of, an expression vector or a plasmid. 
According to a third aspect of the present invention there is provided a process for the production of a hybrid 
polypeptide according to the first aspect of the present invention comprising constructing a plasmid containing 
40 a nucleic acid sequence according to the second aspect of the present invention, transforming the plasmid into 
a procaryotic or eucaryotic host ceil expression system, expressing the system, contacting the hybrid polypep- 
tide resulting from the expression system with avidin, and harvesting the resulting avidin-bound hybrid poly- 
* v< peptide. 

Preferably, the expression system is either E. Coii or insect cells. 
45 According to a fourth aspect of the present invention there is provided a process for the isolation of a hybrid 

polypeptide according to the first aspect of the present invention comprising contacting the hybrid polypeptide 
with avidin. 

Preferably, the avidin is monomeric avidin, tetrameric avidin or streptavidin. 
Preferably, the polypeptide of interest is cleaved from the isolated hybrid polypeptide. 
1»" Preferably, the hybrid polypeptide is isolated using avidin covalently bound to a chemically inert solid, wa- 
ter and solvent insoluble substrate through a chemically stable non-hydrolyzable linking group. 

Preferably the hybrid polypeptide is isolated using avidin monomer affinity chromatography. Preferably, the 
^ hybrid polypeptide is isolated using avidin monomer affinity chromatography disclosed in EP-A-0414785 
(90310154.1). 

55 EP-A-0414785 discloses a monomeric avidin polypeptide ligand and a novel and particularly efficacious 

process for isolating synthetic or natural molecules and / or biotinylated derivatives thereof, by adsorbtion of 
the molecules of interest onto a novel affinity media which contains avidin fixed to a solid inert support 

According to a fifth aspect of the present invention there is provided a first kit comprising a hybrid polypep- 

5 
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tide according to the first aspect of the present invention and avidin 

Accord.ng to a sixth aspect of the present invention there is provided a second ki» 
acd sequence according to the second aspect of the present invention and VST ^ * 

■ * * ccord,n 9 ,0 3 seventh aspect of the present invention there is provided a third kit • 
• acd sequence which codes for an avidin-binding polypeptide containing f=T k ""^P^mg a nucleic 
h fusable to a nudeic add sequence coding ^P^S^^i^!^^ "* «** 
sequence according to the second aspect of the presentment Z ™ ' ""^ add 

Preferably, the third kit comprises means to fuse the nudeic acid sLZL ~r , .w 
polypeptide to the nudeic acid sequence coding for the oolvoeo L, «, ? ? 9 f ° f the avidi "- bi "<«n9 

» sequence according to the second aspect oJihe JretnUnve„«on " t0 ^ the " UC,eiC acid 

P0.ypTpS b,y,a ^ 0ne ° fthekiteMmPrfeeSmeanSto ^ 
«des'o^ 

inh^^ 

technique disdosed in EP-A-0414785 9 P St6P,SUChaS theav,d,nmonomer 3^ chromatography 

^1^^ "! ° f the P ° fypeptide - t to the C- 
peptide). ' ° f the V*™**™* cont3 '"-"9 the biotin domain to avidin (e.g. a 1.3S poly- 

tributes to the efficiency of JZ^J^£?Z £ commentary above). Promoter strength con- 
tiation of translation is important to^^T^JS^ pr ° Cesses inv °' vad in the ini- 
tides of interest are inhoduced at the ^ t^^^S^ST'' 7 ^ ^ When 
domain to avidin (e.g. a 1.3S gene) no dto^ZZZZT . 9 . lhe polv P e P tide containing the biotin 

cells can be achieved. ^ sequences. Thus, maximal expression levels In host 

Co^rr^ 

to achieve maxima, expresston levels Tn hS ceSs ' expem,inatfon a " d ^"9 is required 

avidMl^ 

biotin attachment region m*y bTprese^S fthua n h IZT?' "° th3tthe ^conformation of the 
disdosed in Murtif and SamL fib^T ? methods < such as »— 

which in fact teach away from the ^1 werton) "S^ 782137 and ^-4839293, 

4839293) and fused a polypepuaW^ US-A4782137 and US-A- 

to avidin such as a 1 .aV^y^ * "» *e biotin domain 

was indeed bfethylate?TS2S^V^;^? T^ 6 * 138 «** hybrid 

the addition of a polySpfide SZrr^^*^ 5 ^ Sa ^ 
conf„r^ 

peptide, at the C-tenninus oTtoe 1 3S SESi ~" J' mn 9 *> avidin. such as a 1.3S poly. 

Polypepboewiminmish^^^^ 
teach mat fusion at me Un7i^^^^ 

formation of the 1.3S polypep™^^^^^^^^ * T" h ^ hobi ^ and "us the native con- 

polypeptides to avidin ^ b,ot,nylat.on. and consequently inhibiting the binding of hybrid 

the biotin molecule avaflable foTbinS £ h t ! T ° a " 0ther P 0 ^'^ « Positioned so as to make 
0414785. " b ' nd,n9 11,6 hybnd P°'ypepUde to the avidin monomer affinity resin of EP-A- 
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Furthermore, the hybrid polypeptide is selectively retained by the avidin resin and can be recovered in high 
yield and high purity. 

In the present invention, a binding domain, or recognition sequence, directs the attachment of biotin to the 
hybrid polypeptide. The biotinylated hybrid polypeptide can then be specifically selected by affinity ligand com- 
5 positions such as the avidin monomer resin of EP-A-0414785. 

The present invention therefore yields a single purification step that can separate the protein of interest 
from complex mixtures, such as crude bacterial lysates, with high levels of recovery in a single chromatographic 
step, thus alleviating the recovery problems inherent to multistep purification processes. Such a combination 
of hybrid polypeptide and avidin monomer affinity resin would clearly confer significant advantages to the pur- 
10 ification of commercially useful polypeptides over existing processes. 

An avidin-binding polypeptide for attachment is generally a polypeptide that enables the attachment of a 
hybrid fusion polypeptide to avidin. Among such polypeptides for attachment that may be used are those con- 
taining a recognition sequence for attachment of the prosthetic group biotin, such as the 1.3S polypeptide sub- 
unit of transcarboxylase from Prop ioni bacterium. 
15 In addition to using the entire sequence of the 1.3S polypeptide as the polypeptide for attachment, other 

smaller portions of the 1 .3S polypeptide may be used which direct the attachment of biotin, particularly portions 
comprising ail or part of amino acid residues 58 through 100 (SEQ ID NO: 2). 

One or more deletions, substitutions, insertions or mutations may be made by methods well known in the 
art which result in a biotinylated 1.3S polypeptide or biotinylated fragment The nucleotide sequence coding 
20 for the 1.3S polypeptide or fragments may be synthesized using a commercially available DNA synthesizer in 
a manner well known in the art 

Additionally, other polypeptides or portions thereof that are enzymaticaly biotinylated may also be em- 
ployed. 

Unless indicated otherwise, the term "avidin" includes streptavidin. 
25 The polypeptide of interest may include two or more polypeptides of interest 
The two or more polypeptides of interest may be fused sequentially. 
Optionally, contiguous polypeptides of interest may be fused sequentially. 

Additionally, more than one polypeptide of interest may be present in a noncontiguous arrangement, for 
example, one polypeptide of interest may be fused to the N-terminus of the polypeptide for attachment and one 
30 polypeptide fused at the C-terminus of the same polypeptide for attachment 

Two polypeptides of interest may be fused to the C-termini of two polypeptides for attachment arranged 
as: polypeptide for attachment 1 - polypeptide of interest 1 - polypeptide for attachment 2 - polypeptide of in- 
terest 2. 

In any of the arrangements disclosed here, the polypeptides of interest may be the same, or different The 
35 polypeptides for attachment may be the same, or different 

An advantage of fusing a plurality of polypeptides of interest to at least one polypeptide for attachment is 
the ability to increase the yield of a single polypeptide of interest present by including two copies of that poly- 
peptide within a single hybrid polypeptide, and/or to increase the number of polypeptide species that can be 
purified simultaneously by a single avidin affinity chromatography step, if the polypeptides of interest are dif- 
40 ferent 

4 ^ A cleavage amino acid or sequence of amino acids may be present between the polypeptide containing 

the biotin domain to avidin and the polypeptide of interest 
H tikewise, a cleavage amino acid or sequence of amino acids may be present between the polypeptides of 

; interest if two or more.of such polypeptides are present Such linking, or deavage, amino acid(s) permits the 
*;45 separation of polypeptides at a specific site or sites on the hybrid polypeptide when it is treated with the ap- 
propriate chemical reagent or enzyme. If desired, the cleavage site is positioned adjacent the polypeptide of 
interest so that the polypeptide of interest may be cleaved from the polypeptide for attachment 

The hybrid polypeptide itself may contain a linking amino acid or amino acids for cleaving the polypeptide 
or polypeptides of Interest from the polypeptide or polypeptides for attachment The linking amino acid or amino 
J 50 acids are incorporated between the polypeptide or polypeptides for attachment and the polypeptide or poly- 
peptides of interest iasuch a way that one or more cleavage reactions separate each polypeptide species to 
the degree necessary for intended applications. It may not in every instance be necessary to cleave all, some, 
or any of the species within a particular hybrid polypeptide. 

Amino acids that may be used to linked{ the polypeptide of interest to the polypeptide for attachment include 
55 aspartic acid-proline, asparagine-glycine, methionine, cysteine, lysine-proline, arginine-proline, isoleucine-glu- 
tamic acid-glycine-arginine, and the like. 

The at least one polypeptide for attachment may be cleaved from the at least one polypeptide of interest 
by exposure to the appropriate chemical reagent or cleaving enzyme. 

7 
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sypin from cotton seed, and the like Polypephdes, such as ncin from castor bean or gos- 

abases, are also contemplated by mis invention """""P" ■» «•*•» »» pceaenc. of certain 

cine Procaine. polvrteMi^ 

vector comprising a nucleic acid seouence aeror*,™ t« h-^ oreucaryooc cells transformed by a cloning 
purified away from the conJSSSS S£ KJSS? 'T ' ^ hybrW P 0 '*^ is 
form of avidin is avidin monomer * " afRn,ty chromat °9raphy- A particularly prefered 

column. The column is then washed with adwuate^unS 2 I 1 ^ ! ° ^ avidin monomer 
material, The hybrid polypeptide Is tTen SSZZZ 51 ^ to *" 1,16 00tann ° f Unbound 

deav^r^y^ 

cleaved mixture over the avWi SZZTf"?* " 6n2yme - Passa 9 e of the 

^andfrep^^ 

c.udetve*^ 

Polypeptide of ^o^^ to «"* *-* and high purity of the desired 

? ^l^n^T^lTL be produced by recombinart DNA "*»* * «"**0 

at least one |H-i^lSSiSS?J r^LZ- ^ 'f ^ DNA Sequences «*■ fo ' 

quence coding £ a polySp^fTn^t ' ^ ** ^ l ° ^ and at ,east one DNA - 

tachment ° tad> f ° f cleav,n 9 a Polypeptide of interest from a polypeptide for at- 

Genes coding for the various types of polypeptides of interest, for example those identified above, may be 
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obtained from a variety of procaryotic or eucaryotic sources, such as plant or animal cells or bacterial cells. 
Genes can be isolated from chromosomal material of eucaryotic or procaryotic cells, or from plasmids or viruses 
of procaryotic or eucaryotic cells by employing standard, well-known techniques. 

Additionally, automated DNA synthesis may be used to obtain DNA coding for naturally-occurring or syn- 
5 thetic polypeptides. To enable chimeric gene expression in host cells, a variety of naturally-occurring and syn- 
thesized DNA expression vectors having genes coding for many different polypeptide molecules are now com- 
mercially available from a variety of sources. 

The desired DNA can also be produced from mRNA by using the enzyme reverse transcriptase. This en- 
zyme permits the synthesis of DNA from an RNA template. 
10 In accordance with the present invention, once genes coding for one or more desired polypeptides of in- 

terest are isolated, synthesized or otherwise obtained, said gene or genes are joined to at least one gene coding 
for a polypeptide containing a recognition sequence for attachment of the prosthetic group biotin, thus enabling 
the attachment of the hybrid polypeptide to avidin. 

A gene directing the synthesis of a polypeptide for attachment is generally one coding for a polypeptide 
15 that enables the binding of a hybrid fusion polypeptide to avidin. Among such genes coding for polypeptides 
for attachment that may be used are those coding for amino acid sequences that direct the attachment of the 
prosthetic group biotin , for example the gene for the 1.3S polypeptide subunit of transcarboxylase from Pro- 
pionibacterium. 

A gene coding for a polypeptide for attachment may be one that directs the attachment of the prosthetic 
20 group biotin. A particularly biotinylated preferred biotinyiated polypeptide for attachment is the 1.3S subunit of 
transcarboxylase from Propionibacterium shermanii (SEQ ID NO:1). Although the gene coding for the entire 
1.3S polypeptide from Propionibacterium shermanii is preferred(SEQ ID NO: 4), optionally any gene or gene 
fragment coding for a polypeptide that directs the attachment of biotin may be suitable. The preparation of gene 
fragments is well known to those skilled in the art. 
25 The gene or genes coding for the at least one polypeptide of interest and the gene or genes coding for the 

polypeptide or polypeptides for attachment are preferably treated with the appropriate restriction enzymes, or 
otherwise treated to have cohesive termini to facilitate ligation with other elements of the chimeric gene or the 
DNA expression vector. 

The resulting DNA expression vector carrying the chimeric hybrid polypeptide gene is used to transform 
30 the appropriate procaryotic or eucaryotic host cell. The selection of a DNA expression vector appropriate for 
the desired host cell is well known to those skilled in the art 

Following the transformation procedure, the transformed host cells are isolated and analyzed for expres- 
sion of the hybrid polypeptide. Those transformants identified as containing the hybrid polypeptide are further 
analyzed by restriction enzyme digestion, DNA sequencing and other methods for confirming the correctness 
35 of the desired gene, by methods well known to those skilled in the art 

The transformants identified as host cells carrying the gene for the desired hybrid polypeptide are then mul- 
tiplied in culture to cause replication of the vector and high-level expression of the hybrid polypeptide that con- 
tains the polypeptide of interest 

The cloning vector may be used to transform additionally other strains of compatible hosts for large-scale 
40 production of the hybrid polypeptide. 

Various methods used for obtaining genes or gene fragments, preparing DNA expression vectors, trans- 
forming host cells, expressing hybrid polypeptides in host cells, and identifying those polypeptides are set forth 
* by J. Sambrook, EF. Fritsch, and T. Maniatis, Molecular Cloning, 2nd Edition. Cold Spring Harbor Press, 1 989, 
* and also by F.M.Ausubel, R. Brent, Kingston, D.M. Moore, J.G. Seidman, JA. Smith, K. Stiuhl., Eds. Cur- 
45 rent Protocols in Molecular Biology, Volume 1. John Wiley and Sons, New York 1989. 

To prepare DNA expression vectors, various cloning vectors may be used. A plasmid is preferred. However, 
a cosmid or bacteriophage may be used. If insect, plant, or mammalian celts are used as host cells, viruses 
may also be used as vectors. DNA expression vectors may be obtained from natural sources or may comprise 
synthetic DNA. The plasmid chosen for a particular expression system should be compatible with that host, to 
so ensure vector replication and polypeptide expression. The plasmid chosen for incorporation of the genes coding 
for the hybrid polypeptide should possess an origin of replication recognized by the host cell. 
%■ The DNA expression vector should contain DNA sequences recognized by restriction endohuclease en- 

' v zymes to cleave the vector for subsequent ligation with the gene for the hybrid polypeptide wittiout inactivating 
the origin of replication or functions necessary for plasmid selection following transformation, for example within 
55 an antibiotic resistance gene. The vector should contain restriction enzyme cleavage sites that provide suitable 
termini for joining and ligation of foreign genes to be inserted. 

Preferably, the DNA vector contains a single site or two unique sites for incorporation of the hybrid poly- 
peptide gene, neither of which occurs within that gene. 
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It is to be understood that for different chimeric genes, obtaining the correct orientation of the polypeptide 
or polypeptides of interest relative to each other and to the polypeptide or polypeptides for attachment within 
the expression vector employ the basic steps as outlined above. 

Preferably, the gene for each polypeptide of interest is attached in the proper reading frame to the adjacent 
5 gene, and ail adjacent termini are prepared in a manner so as to make them complementary to facilitate ligation. 

Furthermore, genes for any polypeptides of interest requiring cleavage from another polypeptide of interest 
or from one or more polypeptides for attachment must be so constructed as to allow for the proper positioning 
of ail cleavage sites so that their insertion does not result in any genes for any of the polypeptides being in an 
improper reading frame. 

10 The ligation reaction, which covalently joins fragments of DNA, is described in Sambrook, Fritsch, and Man- 

iatis (ibid), and Ausubel et al. (ibid) and is well known to those skilled in the art. 

The ligated plasmid is ready for transformation of host cells. The preferred host is E. coli, however, other 
bacteria, insect cells, yeast, or mammalian or plant cells may be used with a DNA expression vector appropriate 
to that particular host ceil. 

15 Transformation of E. coli is a standard procedure well known to those skilled in the art, wherein a suitable 

host strain, such as E. coli HB101 accepts, harbors, replicates and expresses the plasmid carrying the gene 
for the hybrid polypeptide. Transformation of E. coli is described by Sambrook, et al. If the host is an insect 
cell, transfection may be accomplished by a procedure such as that described by M. Summers and G. Smith, 
A Manual of Methods for Baculovirus Vectors and Insect Cell Culture Procedures. Texas Agricultural Experi- 

20 ment Station Bulletin No. 155, 1988. 

In order to identify the host cells which are transformed, the culture is placed in selective media containing 
an appropriate antibiotic. Only those cells with plasmid-borne resistance will survive. 

Plasmid can be recovered after lysis of surviving cell colonies, and characterized by restriction enzyme 
digestion and mapping, DNA sequencing, or other methods known in the art. Additionally, those colonies which 

25 express hybrid polypeptide can be identified by immunological assay, such as ELISA or Western blotting. In 
some embodiments it may be possible to assay directly for biological activity of the polypeptide or polypeptides 
of interest 

Once transformed cells carrying the hybrid polypeptide are identified, they may be multiplied by established 
techniques, such as fermentation. In addition, the recovered plasmids can be used to transform other strains 
30 of bacteria, or appropriate hosts cells for large-scale production and expression of the hybrid polypeptide. 

The hybrid polypeptide which contains the polypeptide for attachment, the biotin group for binding to avidin, 
the polypeptide of interest, and the optional cleavage site, expressed by the transformed host cells may be 
separated from the medium and other debris by affinity chromatography. 

The preferred affinity medium is the avidin monomer resin described in EP-A-0414785. To this end, host 
35 cells are separated from the medium and broken open for example, by sonication. 

Optionally, hybrid polypeptides can be excreted into the culture media if a signal peptide for extracellular 
secretion is included at the appropriate terminus of the hybrid polypeptide. 

Should such a secreted polypeptide be desired, it may be necessary to include a DNA sequence coding 
for a polypeptide directing extracellular secretion within the chimeric gene coding for the hybrid polypeptide. 
40 The hybrid polypeptide once released is maintained in an appropriate buffer, preferably one in which it is 
soluble. The buffer solution should be formulated to maximize hybrid polypeptide recovery from host cells. Buf- 
fer properties which may be optimized to favor recovery include but are not limited to, pH, ionic composition, 
ionic strength, or presence or absence of various detergent compositions. 
; ' Optionally some fractionation of the host cell extract may be performed in order to concentrate or partially 
45 ' purify the hybrid polypeptide prior to affinity chromatography. 

One preferred method is ammonium sulfate fractionation. It is to be understood that other methods com- 
monly employed in protein purification may also be used prior to affinity chromatography. The cell extract is 
passed over the preferred column for affinity chromatography, the avidin monomer column, which is then wash- 
ed extensively with buffer to remove all unbound materials. 
50"'***; . The hybrid polypeptide is specifically eluted from the column, preferably with acetic acid or biotin. As as 
result, a high yield of highly purified hybrid polypeptide containing the polypeptide of interest is obtained. 

It may be desirable or necessary to cleave the one or more polypeptide of interest from the one or more 
polypeptide for attachment to restore biological activity to the polypeptide of interest 

Separation from the polypeptide for attachment may be accomplished by first suspending the hybrid poly- 
55 peptide in buffer. Thereafter the chemical or proteolytic cleavage agent specific to the. linking amino acid or 
amino acids is added to the suspension and the polypeptide of interest is cleaved. 

For example, if the polypeptide of interest is linked to the polypeptide for attachment by an asp-pro linkage, 
a volatile acid such as formic acid may be added to the suspension to effect cleavage. 
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The following examples are carried out using one or more of the general procedures set forth below. 

In all of the following examples, restriction endonucleases, ligases. polymerases, and other DNA modifying 
enzymes described in specific experimental steps are used according to the recommendations of the manu- 
facturer of the particular enzyme or reagent used. 
5 Two laboratory manuals, Current Protocols in Molecular Biology (Ausubel et al 1989)), and Molecular Clon- 

ing A Laboratory Manual 2nd Edition (Sambrook et al. Cold Spring Harbor Press, Cold Spring Harbor, NY 
(1989)) referenced below, contain supplemental information that may also be useful to one skilled in the art in 
conducting the examples described below. 

10 Procedure I. Restriction Endonuclease Digestion of DNA 

Pvestriction endonuclease digestions using one or more restriction enzymes to digest DNA are generally 
carried out using the protocols set forth in Ausubel et al. Current Protocols in Molecular Biology Volume 1 f Chap- 
ter 3, Unit 3.1. 

is Restriction mapping of plasmids is generally carried out using the protocols set forth in Ausubel et al (ibid), 

Unit 3.2. Restriction enzymes are obtained from Promega (Madison Wl) or New England Biolabs (Beverly MA) 
and complete or partial digestion of DNA with specific enzymes are performed generally according to the man- 
ufacturer's recommendations . 

20 Procedure II. Purification of DNA Fragments Using Agarose Gel Electorphoresis. 

Agarose gel electrophoresis is generally carried out using the protocols set forth in Ausubel et al (ibid). 
Volume I, Chapter 2, Unit 2.5A. Separation and isolation of larger (>1 kb) DNA fragments from excised gel frag- 
ments is generally carried out as set forth by Ausubel et al (ibid), Chapter 2, Unit 2.6, and separation and iso- 
25 lation of smaller (<1 kb) DNA fragments is carried out as set forth in Ausubel et al (ibid) Chapter 2, Unit 2.7. 
Removal of salts and gel fragments from larger DNA fragments is accomplished using the GeneClean kit 
(Bio101 , Inc. San Diego, CA) using procedures supplied by the manufacturer, and removal of salts and gel frag- 
ments from smaller DNA fragments is accomplished using the MerMaid kit (Bio101, Inc. San Diego, CA) also 
using the procedures recommended by the manufacturer. 

30 

Procedure III. Ligation of DNA Fragments. 

Ligation of DNA fragments using T4 DNA Ligase (New England Biolabs Beverly MA) is generally carried 
out as in Ausubel et al (ibid), Unit 3.14, using conditions recommended by the manufacturer. 

35 

Procedure IV. Preparation of Competent E. coli Cells and Transformation of E. coli. 

Preparation of competent E. coli CSH26 cells using calcium chloride and transformation with DNA expres- 
sion vectors are carried out using the protocols described by Sambrook et al Molecular Cloning A Laboratory 
40 Manual 2nd Edition 1989, Chapter 1. 

Clones carrying plasmids containing DNA insertions are identified by growing cells on L-agar supplemented 
with 1 00 mg/L ampicfllin. 

Procedure V. Isolation of Plasmid DNA. 

45 

Plasmid DNA is generally usubel et al (ibid) Chapter 1, Unit 1.7. 
Procedure VI. Preparation of Double-stranded DNA from Synthetic Single-stranded Oligonucleotides. 

so Oligonucleotides are synthesized using standard phosporamidite chemistry (0.2 micromole synthesis). 

Fragments are separated on a 20% polyacrylamide gel under denaturing conditions as described by Ausubel 

et al (ibid) Volume 1, Unit 2. 12 and eluted and desalted as described therein. 

Double-stranded DNA is assembled from upper strand and lower strand pairs of synthetic oligonucleotides 

with overlapping regions of perfect complimentarity of 15 nucleotides at their 3' ends by heating a mixture of 
55 1 ug of each of the strands at 90°C for 5 minutes, followed by slow cooling to room temperature over a period 

of one to two hours. This short duplex region serves as template and primer for mutually primed synthesis of 

a complete double DNA strand with Sequenase, a T7 DNA polymerase obtained from US Biochemical, using 

protocols supplied by the manufacturer. 
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inP^r^^ 

Procedure VII. Preparation of Crude E. coli Cell Extract 

E. coli host cells harboring the plasmid carrying the qene for the hvhrid nnK/nontM- ~~ 

Cells are collected by centrifugation in a GSA rotor at 5.000 X G for 30 minutes at 4»C in a S«~»n pr » 

The pellet formed by the addition of 60% ammonium sulfate is resusoended in 1 on m w ,w, - 
Procedure VIII. Avidin Monomer Affinity Chromatography. 

no ^foVaTKl H^cttL° 3 ^ X ! ^ °' Umn With avWin monomer affini * ^sin (US seria. 

no ^i4. raoj on a LKB HPLC system equipped with two Model 21 50 pumps Model 21 52 controller a „w , mI. 

0. 1 ml /mm to a column equilibrated with the same buffer ' 0f 

Procedure IX. E.utlon of the Hybrid Polypeptide from the Avidin Monomer Affinity Resin 

A. Elution using acetic acid 

s^^SL^jsrisrr " M * — - - 

B. Elution using biotin 
A. Add cleavage 

[Reference: London. M. (1977) Methods in Enzymology 47:145-149 ] 

^T™^^Z^Zm to T /o forrnic add (v/v) and incubated at 40 ° C for 24 »«> ^ 

e rn.xn.re e then freeze-dned. H.ghly pure polypeptide of interest is obtained by Procedure VIII. 
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B. Cyanogen bromide cleavage of methionine residues [Reference: Gross, E. and B. Witkop (1961) journal 
of American Chemical Society 83:1510-1511.] 

The hybrid polypeptide is dissolved in 70% (v/v) aqueous formic acid at 23°C. A 50 molar excess of cya- 
5 nogen bromide is added in a small volume of 70% formic acid, with stirring. 

The mixture is incubated in the dark under nitrogen at 2G-25°C for 16 to 24 hours. The mixture is then diluted 
with 1 0 volumes of water and freeze-dried. 

Highly pure polypeptide of interest is obtained by Procedure VIII. 

10 Procedure XL Separation of the Polypeptide of Interest from the Polypeptide for Attachment 

The dried polypeptide mixture is resuspended in avidin monomer column loading buffer, 100mM phosphate 
buffer pH 6.8-72, or other appropriate buffer at a pH of between 4.0 and 1 1.0. Highly pure polypeptide of interest 
is obtained by passing the cleaved polypeptide mature over the avidin resin using the procedure described 
15 above. 

The polypeptide for attachment is retained by the avidin monomer and the polypeptide of interest is not 
retained. The polypeptide of interest is collected in the column flowthrough. 

Procedure XII. Plasmid Expression Vectors. 

20 

Two plasmids are obtained from D. Samols, Case Western Reserve University. 

A. Plasmid ptac 1.3L This plasmid contains the DNA sequence coding for the 123 amino acid sequence 
of the 1.3S polypeptide of transcarboxylase from Propionibacterium shermanii (SEQ ID NO:2). The DNA 
coding for the 1.3S polypeptide is cloned as a 431 base pair fragment into the polylinker region of the ex- 

25 pression vector pKK223-3 as described by Murtif et al. (Proc Nat Acad Sci USA 82:5617-5621 (1985)). 

B. Plasmid ptad. 3(1-125). The plasmid ptac 1.3(1-125) is described by Murtif and Samols, J Biol Chem 
262:11813-11815 (1987). Like ptac1.3t, ptac1.3(1-125) also contains the 1.3S polypeptide but in addition 
has the sequence: 

M GAT CCA TAA CGC CTA AGC TT (SEQ ID N03) 

at the 3' end of the 1.3S gene that encodes a BamHI restriction endonuclease site. This DNA additional se- 
quence codes for the linking amino acid sequence asp-pro at the carboxyl terminus of the 1.3S polypeptide. 
In order to illustrate the nature of this invention and the manner of practicing the same, the following ex- 
35 amples are presented. 

Example 1. Modification of ptad .3(1-125) to increase hybrid polypeptide expression levels in E. coll 

In order to increase the expression level of hybrid polypeptides produced from chimeric genes inserted into 
AO ptad .3(1-125) from approximately 0. 1 % of total soluble cellular protein to approximately 5.0% of total soluble 
cell protein, ptac^3(1-125) was modified as follows, ptad .3(1-1 25) was digested with the restriction enzymes 
Xhol and Hindlll. The desired 131 base pair (bp) fragment was obtained by agarose gel electrophoresis. 

The vector ptad. 3t was also digested with Xhol and Htndlll using the conditions described above, and the 
4.86 kilobase (kb) fragment was obtained by agarose gel purification. The plasmid ptac1.3dp (Figure 1) was 
45 obtained by ligation of the 131 bp fragment from ptad .3(1-125) to the 4.86 kb fragment of ptad.3t 

The Hgated plasmid mixture was used to transform competent E. coli HB101. An E. colt clone harboring 
ptad .3dp was identified by restriction enzyme digestion of plasmids isolated from selected ampicillin-resistant 
E. coli cells. 

so Example 2. Fusion of a polypeptide of interest to the C-terminus of a polypeptide for attachment with an 
acid cleavage site between the polypeptides 

This example describes a hybrid polypeptide in which a synthetic b-endorphin polypeptide is fused to the 
carboxyl terminus of the 1 . 3S polypeptide. An asp-pro cleavage site is incorporated between the two polypep- 
55 tides for cleavage and subsequent purification of b-endorphin away from the 1.3S polypeptide after affinity pur- 
ification using avidin monomer resin. 

The amino acid sequence of a modified b-endorphin polypeptide is shown in SEQ ID NO:6 and the corre- 
sponding nucleotide sequence coding for this amino acid sequence is shown in SEQ ID NO:7. 



15 



EP0 511 747 A1 



/ DU T u he Synlhetic oligonucleotides from which this synthetic gene was assembled are shown in SEO in una 

ZXIT ,D N ° :9 (RHCb62) - Th6re ~ "° inl ~ -idues XS^SSSi 

at thI h 5- B enT o! SSSSJI t T reC09n " i0n SC<,UenCe 6GATCC at nudeo ^ Positions 12 through 1 7 
at the 5 end of SEQ ID NO:6 allows the introducion of an ATG codon at the 5' terminus of the b-endomhin 
gene when this fragment k introduced into the BamHI site of ptac1.3dp (SEQ ID h£X££ZZ?1Z 
th.on.ne at the N-terminus of the polypeptide. ' V 9 3 me ~ 

To maximize expression in E. coli. the amino acid sequence of authentic b-endorphin was reverse trans- 
lated mto a DNA sequence using preferred codon usages of highly expressed E S (D^er K fcteZ 
8. Maximizing Gene Expression. W. Reznikoff and L. Gold Eds) 9 enss (DeBoer. H Chapter 

cedu^VM 61 RHCb62 WCre SymheSi2ed - annea ' ed 3nd fi " ed by 17 DNA to^'™* « Scribed in Prc- 

bo flllT^ d ° f Ub, r tranded ° NA SCC,uence < SEQ 10 NO:5 ) was digested with BamHI . generating 105 
bp fragment codmg for the synthetic b-endorphin which was purified by agarose gel electropno^s? 

Pure l.near plasmid was recovered by agarose gel electrophoresis, and the 1 1 0 bp synthetic b-Anrfomhin 
fSfSES ^ ' i9ated 10 1,16 PUC19 P,aSmid ' a " d th ' S *~ P ' asmid - -dforn^Cett 

Following plasmid isolation from ampicillin-resistant clones, the recombinant E coli cells harborino th* ™ 
^yn^ 

me synthetic b-endorphin was designated pUC1 9endorB3. 
Cloning of synthetic b-endorphin into ptad.3dp. 

The b endorphin gene in pUC19endorB3 was fused to the 3' terminus of the 1 n*no ,•« ^ ^ *a 

Cte'n^ 31 1,16 N - terminUS 0f *> deaved endorphin polypeptide 9 
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Example 3. Fusion of a polypeptide of interest to a polypeptide for attachment, with a methionine cleavage 
site between the two polypeptides. 

In this example, the synthetic b-endorphin polypeptide is fused at its N-terminus to a methionine residue, 
5 this methionine residue being positioned at the C-terminus of the 1.3S polypeptide, thus providing a single ami- 
no acid cleavage site for separation of b-endorphin from the 1.3S polypeptide following avidin monomer chro- 
matography. 

Cleavage with cyanogen bromide yields an unmodified N-terminus on b-endorphin, as the methionine is 
cleaved from the N-terminus of b-endorphin. 

10 The vector ptac1.3dp is digested to completion with Xhol and Hindlll. and the 131 bp fragment is purified 

by agarose gel electrophoresis. This 131 bp fragment is subjected to partial digestion with Sau3A, and the 110 
bp fragment so generated is isolated and purified by agarose gel electrophoresis. This Xhot-Sau3A fragment 
is ligated to a double-stranded synthetic DNA fragment (SEQ ID NO:12) coding for b-endorphin. This b-endor- 
phin has a methionine residue at its N-terminus. This fragment in SEQ ID NO:12 is assembled from synthetic 

is oligonucleotides SEQ ID NO:10 and SEQ ID NO:1 1 as described in Procedure VI, and is digested to completion 
with Sau3A prior to ligation to the 110 bp Xhol-Sau3A fragment. The 220 bp product of this ligation is purified 
by agarose gel electrophoresis. 

The vector ptac1.3dp is subjected to partial digestion with Xhol and BamHI , and the 4886 bp linear vector 
is purified by agarose gel electrophoresis. The ligated plasmid is used to transform competent E. coli CSH26, 

20 prepared according to Procedure IV. Plasmids are isolated from transformed ampiciilin-resistant E. coli clones, 
and a plasmid containing the desired gene in the correct orientation is identified by restriction enzyme analysis 
and designated ptad .3dp:metendor (Figure 2C). 

A highly pure preparation of the synthetic b-endorphin is obtained by inoculation of L-broth containing 100 
mg/l ampicillin with the E. coli host harboring ptad. 3:metendor. A crude protein extract containing the 1.3S:b- 

25 endorphin hybrid polypeptide is obtained by following Procedure VII; 

Highly pure 1.3S:b-endorphin polypeptide is obtained by avidin monomer affinity chromatography descri- 
bed in Procedure VI ll, using elution with acetic acid to elute the purified hybrid polypeptide from the resin (pro- 
cedure VIHA). Cleavage of b-endorphin from the 1 .3S polypeptide was accomplished by incubation in cyanogen 
bromide according to Part B of Procedure X, and highly pure b-endorphin is obtained by avidin monomer affinity 

30 chromatography of the cleavage mixture by repeating Procedure XI. 

Example 4. Fusion of a polypeptide of interest directly to the carboxyl terminus of a polypeptide for attach- 
ment with no linking amino acid sequence being present in the hybrid polypeptide. 

35 In this example, a synthetic b-endorphin polypeptide is fused directly to the C-terminus of the 1.3S poly- 
peptide. Avidin monomer chromatography is used to obtain highly pure b-endorphin in the form of a hybrid fu- 
sion polypeptide. 

The construct ptac1.3t (Procedure XII) is digested to completion with Xhol and Hindlll, and the smaller 131. 
bp fragment is purified by agarose gel electrophoresis. This 131 bp fragment is subjected to partial digestion 
40 with Sau3A I, and the 110 bp fragment is agarose-gel purified. The 110 bp fragment is ligated to the double 
stranded DNA fragment SEQ ID NO:15. 

SEQ ID NO:1 5 encodes a synthetic b-endorphin gene with no DNA coding for a linking amino acid or amino 
acid sequence at its 3' terminus. 

SEQ ID NO:15$s assembled from synthetic oligonucleotides SEQ ID NO:13 and SEQ ID NO:14 using Pro- 
45 cedureVI. 

Prior to ligation to the 110 bp fragment SEQ ID NO:15 is digested to completion with Sau3A I. The 217. 
bp ligation product is purified by agarose gel electrophoresis. 

Vector ptad .3dp is linearized by partial digestion with Xhol and BamHI , and the 4886 bp fragment is also 
purified using agarose gel electorphoresis. 
so The ligated plasmid is used to transform competent E. coli CSH26. 

Recombinant plasmids are isolated from ampiciilin-resistant transformants, and a clone containing the de- 
sired gene in the correct orientation is identified by restriction enzyme analysis and designated ptac1.3:endor 
(Figure 2D). 

A highly pure preparation of the synthetic b-endorphin is obtained by inoculation of L-broth containing 100 
55 mg/l ampicillin with the E. coli host harboring ptac1.3:endor. 

A crude protein extract containing the the 1 .3S:b-endorphin hybrid polypeptide is obtained by following Pro- 
cedure VII. 

Highly pure 1.3S:b-endorphin polypeptide is obtained by avidin monomer affinity chromatography as de- 
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In this example, two b-endorphin polypeptides in tandem are ft.caH f rt »h 0 r» 
Me. with the linking amino acids Lp-£>-^ J C term.nus of one 1.3S polypep- 

of the 1.3S polypep.de and f ™ *• 

gel electrophoresis ° ' D N0:5Th,s fra 9™nt is purified by agarose 

assembled from SEQ ID NO:19 and SEQ ID NO 20 * " d SEQ ,D N0:21 ' 

SEQ ID NO:1S and SEQ ID NO:21 are digetM wilt, BamHI and Boated 

r ~ * - 

ment is ligated to the 1 1 8 bo Xhol-BamHl 1 S 1« ? ' S *. d by agarose 9 el electrophoresis, and this frag- 
product is' Puri fied by agarose g I 225=^! P- 81 - 3 *- ™ S ^ 
digestion of p,ac1.3dp linear^ by ^ZS££3£2£? * * >— 

Plasmid DNA is isolated from transformed E coli HRmi Piae™;,^ ~ * • • ef 

am™,, dh™™^ hISSSSS- !T *T! * W » K ™' 

Irem Ore resin (predddureVIIIA). ™" vl "' us,n 9 araMa ^ "d""" "laportled hyMdpdlnrepSdd 



td Old C-renWreid of ihe adore 1 m »»tZjSf J ? P°Dwdd. «d S»n0retld b-dndoaphln w» toed 
ndnddadlgudd, p^SS^ SST^ "* ' WyP "" k ' e ' :0nsls,l "' rf '*> 

dorB3. containing 100 mg/J amp,cfllin with the E. coli host harboring ptacrmalB: 1.3:en- 
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A crude protein extract containing the hybrid polypeptide is obtained by following Procedure VII. 

Highly pure maltose binding protein:1.3S:b- endorphin hybrid polypeptide is obtained by avidin monomer 
affinity chromatography described in Procedure VIII, using biotin to elute the purified hybrid polypeptide from 
the resin (Procedure VIIIB). 

5 Biotin is removed from the polypeptide suspension by dialysis against three changes of 1 00 mM ammonium 

carbonate buffer, pH 7.2, followed by freeze-drying of the sample. 

Cleavage of b-endorphin from the 1.3S polypeptide is accomplished by incubation in cyanogen bromide 
according to Part B of Procedure X, and highly pure b endorphin is obtained by avidin monomer affinity chro- 
matography of the cleavage mixture by repeating Procedure XI. 

10 The maltose-binding protein-1.3S hybrid polypeptide was recovered in highly pure form by repeating the 

biotin elution procedure described in Procedure VIIIB. 

Example 7. Fusion of two polypeptides of interest in tandem to the N-terminus of a polypeptide for attach- 
ment with an amino acid cleavage sequence separating the first polypeptide of interest from the polypep- 
15 tide for attachment and a second amino acid cleavage sequence separating the first polypeptide of interest 
from the second polypeptide of interest 

The maltose binding protein and b-endorphin are fused in tandem to the amino-terminus of the 1.3S polyr 
peptide. An amino acid cleavage site separates the maltose binding protein and b-endorphin, and another ami- 

20 no acid cleavage site separates b-endorphin and the 1.3S polypeptide. 

The plasmid ptac1.3dp is digested with Hindi and Hindlll. The 332 bp fragment is purified by agarose gel 
electrophoresis. Linkers encoding a BamHI recognition sequence (CGGATCCG) are ligated to this fragment, 
and the fragment is digested with BamHI to generate a BamHI site at the 5' terminus of the fragment The 338 
bp BamHI -Hindlll fragment so generated is purified by agarose gel electrophoresis . 

25 The DNA fragment SEQ ID NO:15 is digested with BamHI , and is ligated to the 338 bp BamHI-Hindlll modi- 

fied fragment from ptad .3dp. The desired 444 bp fragment is purified by agarose gel electrophoresis. The vec- 
tor pMAL-c (New England Biolabs, Beverly, MA) is digested to a 6.1 kb fragment with Hindlll and BamHI , and 
the large fragment is purified by agarose gel electrophoresis. The 444 bp fragment and the 6.1 kb fragment 
are ligated, and the ligation mix is used to transform competent E. coli CSH26. 

30 Plasmid DNA is isolated, and plasmids containings the chimeric gene in the correct orientation are con- 

firmed by restriction endonuclease mapping, and are designated ptac:ma!C:endorB3: 1 .3dp (Fig. 3C). The cor- 
rect recombinant plasmid codes for a fusion protein composed of a 42,000 M W maltose binding protein fused 
by an asp-pro-met linker to b-endorphin joined by an asp-pro linker to amino acids 19-123 of the 1.3S poly- 
peptide having an asp-pro carboxyi terminus. 

35 Highly pure maltose binding proteimb- endorphin: 1.3S hybrid polypeptide is obtained by avidin monomer 

affinity chromatography described in Procedure VIII, using biotin to elute the purified hybrid polypeptide from 
the resin (Procedure VIIIB). Biotin is removed from the polypeptide suspension by dialysis against three 
changes of 100 mM ammonium carbonate buffer, pH 7.2. followed by freeze-drying of the sample. 

Cleavage of b-endorphin from the 1 .3S polypeptide is accomplished by incubation in formic acid according 

40 to Part A of Procedure X, and highly pure b-endorphin is obtained by avidin monomer affinity chromatography 
of the cleavage mixture by repeating Procedure XI. The maltose binding protein is recovered in highly pure form 
by elution of the maltose binding protein:1 .3S hybrid polypeptide from the avidin monomer resin with biotin using 
Part B of Procedure IX 

The hybrid polypeptide is purified away from the biotin by dialysis against three changes of 100 mM anv 
45 monium carbonate buffer, pH 7.2, followed by freeze-drying of the sample. The sample is freeze-dried, and 
reconstituted in cyanogen bromide according to Part B of Procedure X. The maltose-binding protein is recov- 
ered in highly pure form by repeating the avidin monomer chromatography process detailed in Procedure XI. 

Example 8. Fusion of two polypeptides of interest to the C-termini of two polypeptides for attachment within 
so t he same hybrid polypeptide. 

In this example, two noncontiguous b-endorphin polypeptides are fused to two noncontiguous 1.3S poly- 
peptides within one hybrid polypeptide with a cleavage amino acid sequence between each 1.3S polypeptide 
and the b-endorphin to which it is directly linked, producing the fusion hybrid polypeptide 1.3S:asp-pro-met b- 
55 endorphin:asp-pro:1.3S:asp-pro-metb-endorphin. 

The vector 1.3dp:endorB3 (Figure 2A) is digested with Hindi and Hindlll and the 437 bp fragment is purified 
by agarose gel electrophoresis. Synthetic DNA linkers encoding a BamHI recognition sequence, CGGATCCG 
(New England Biolabs, Inc. Beverly, MA) are ligated to this 437 bp fragment 
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443 ssiShiTss a dn i - s ,o 8amHi di9esti ° n to senerate BamHi c ° hesive ,ermini - ^ 

443 bp BamHI-H.ndlll fragment is purified by agarose gel electrophoresis. The ptac 1.3endorendor fFioure 
3A) ,s partially digested with BamH. and Hind.ll to yield a linear DNA of 5.1 kb digested STSSmE 

tofhe h 3 r 9 ' e I" 51 kt> fr39ment iS PUdfied * «»~ 9 el *ec£^ 
to the 443 bp fragment The ligation mix is used to transform competent E. coli HB101 

Plasmid DNA is isolated from ampicillin-resistant transformed E coli anH ri« m wc — . • ■ ... 

chimeric gene orientation are confirmed by reason endorse mapping ** 

The recombinant plasmid obtained by this procedure is desianated otad ^ n H«r -» /c- 

endorph,n.1.3S.b-endorphin hybnd polypeptide is obtained by following Procedure VII 
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SEQUENCE LISTING 
SEP. ID NO:l 

SEQUENCE TYPE: Peptide 

SEQUENCE LENGTH: 123 

MOLECULE TYPE: Protein 

ORIGINAL SOURCE ORGANISM: Bacterium 

SOURCE NAME: Propionibacterium shermanii 

FEATURES: From 58 to 100 - biotin-binding recognition sequence 

PROPERTIES: 1.3S biotin-binding protein 



Met Lys Leu Lys Val Thr Val Asn Gly Thr Ala Tyr Asp Val Asp Val 
15 10 15 

Asp Val Asp Lys Ser His Glu Asn Pro Met Gly Thr lie Leu Phe Gly 

20 25 30 

Gly Gly Thr Gly Gly Ala Pro Ala Pro Arg Ala Ala Gly Gly Ala Gly 

35 40 45 

Ala Gly Lys Ala Gly Glu Gly Glu He Pro Ala Pro Leu Ala Gly Thr 

50 55 60 

Val Ser Lys He Leu Val Lys Glu Gly Asp Thr Val Lys Ala Gly Gin 
65 70 75 80 

Thr Val Leu Val Leu Glu Ala Met Lys Met Glx Thr Glu lie Asn Ala 

85 90 95 

Pro Thr Asp Gly Lys Val Glu Lys Val Leu Val Lys Glu Arg Asp Ala 

100 105 110 

Val Gin Gly Gly Gin Gly Leu He Lys lie Gly 
115 120 
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SEQUENCE TYPE: Peptide 

SEQUENCE LENGTH: 43 

MOLECULE TYPE Peptide 

ORIGINAL SOURCE ORGANISM: Bacterium 

SOURCE NAME: Propionibacterium shermanii 

PROPERTIES: Biotin-binding recognition sequence 

Pro Ala Pro Uu Ala Gly Thr VaJ Ser Lys De Uu Val Lys Glu Gly 

15 10 15 

Asp Thr Val Lys Ala Gly Gin Thr Val Uu Val Uu Glu Ala Met Lys 



20 25 



30 



Met Glx Thr Glu De Asn Ala Pro Thr Asp Gly 
35 40 

SEP ID MO 7 

SEQUENCE TYPE: Nucleotide 
SEQUENCE LENGTH: 20 
STRANDEDNESS: Double-stranded 
TOPOLOGY: Linear 
MOLECULE TYPE: Genomic DNA 

PROPERTIES: Termination fragment for a BamHI cleavage site 
GATCCATAAC GCCTAAGCTT 
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20 



25 
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SEP ID NO:4 

SEQUENCE TYPE: Nucleotide 
SEQUENCE LENGTH: 372 
STRANDEDNESS: Double-stranded 
TOPOLOGY: Linear 
MOLECULE TYPE: Genomic DNA 
ORIGINAL SOURCE ORGANISM: Bacterium 
SOURCE NAME: Propionibacterium shermanii 
PROPERTIES: Gene coding for 1.3S polypeptide 



ATGAAACTGA AGGTAACAGT CAACGGCACT GCGTATGACG 40 

TTGACGTTGA CGTCGACAAG TCACACGAAA ACCCGATGGG 80 

CACCATCCTG TTCGGCGGCG GCACCGGCGG CGCGCCGGCA 120 

CCGCGCGCAG CAGGTGGCGC AGGCGCCGGT AAGGCCGGAG 160 

AGGGCGAGAT TCCCGCTCCG CTGGCCGGCA CCGTCTCCAA 200 

GATCCTCGTG AAGGAGGGTG ACACGGTCAA GGCTGGTCAG 240 

ACCGTGCTCG TTCTCGAGGC CATGAAGATG GAGACCGAGA 280 

45 TCAACGCTCC CACCG ACGGC AAGGTCG AG A AGGTCCTTGT 320 

CAAGGAGCGT GACGCCGTGC AGGGCGGTCA GGGTCTCATC 360 

AAGATCGGCT GA 372 
so ' 



55 



23 



10 



15 



25 



30 



35 



40 



45 



SO 



EP 0 511 747 A1 

SEP ID MP S 

SEQUENCE TYPE: Nucleotide 
SEQUENCE LENGTH: 390 

STRANDEDNESS: Double-stranded 
TOPOLOGY: Linear 
MOLECULE TYPE: Genomic DNA 
PROPERTIES: Oligonucleotide ptacl.3dp 



AGGTAACAGT ^AACGGCACT GCGTATGACG 
TOACGTTGA CGTCGACAAG TCACACGAAA ACCCG ATGGG 

^^CTGTrCC^COXGarACCaxrGGCGCGGCGG^A 
CCGCGCGCAG CAGGTGGCGC AGGCGCCGGT AAGGCCGGAG 

agggcgagattcccgctccgciggccggcaccgtctccaI 

GATCCTCGTG AAGGAGGGTG ACACGGTCAA GGCTGGTCAG 

"""^ CATGAAGATG --CCG^A 
TCAACGCTCC CACCGACGGC AAGGTCGAGA AGGTCCTTGT 

caaggacxgtgacgccgtgcagggcggtcagggtctSt^ 

AAGATCGGCT GATCCATAAC GCCTAAGCTT 
SEOID1MO-A 

SEQUENCE TYPE: Peptide 

SEQUENCE LENGTH: 31 

MOLECULE TYPE: Peptide 

PROPERTIES: Modified b-endorphin polypeptide 

TyrGlyGlyPheUuThrSexauLysSerGInrn.ProUuValThr 

5 10 15 

Leu Phe Lys Asa Ala He He Lys Asn Ala Tyr Lys Lys Gly GIu 
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SEOIDNO:7 



SEQUENCE TYPE: Nucleotide 
SEQUENCE LENGTH: 135 
STRANDEDNESS: Double-stranded 
TOPOLOGY: Linear 
MOLECULE TYPE: Genomic DNA 

PROPERTIES: Gene coding for modified b-endorphin polypeptide 



AAGCTTCTAG AGGATCCTAT GTACGGTGGT TTCCTGACCT 40 

20 CCGAAAAATC TCAGACCCCG CTGGTTACTC TGTTCAAAAA 80 

CGCTATCATC AAAAACGCAT ACAAAAAAGG CGAATAAGGA 120 
TCCGAATTCG AGCTC 



SEP ID NO: 8 

30 

SEQUENCE TYPE: Nucleotide 
SEQUENCE LENGTH: 75 
35 STRANDEDNESS: Double-stranded 
TOPOLOGY: Linear 
MOLECULE TYPE: Genomic DNA 
PROPERTIES: Oligonucleotide RHcbel 



4S 



SO 



AAGCTTCTAG AGGATCCTAT GTACGGTGGT TTCCTGACCT 40 
CCGAAAAATC TCAGACCCCG CTGGTTACTC TGTTC 75 
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SEP ID NO- Q 

SEQUENCE TYPE: Nucleotide 
SEQUENCE LENGTH: 75 

STRANDEDNESS: Double-stranded 
TOPOLOGY: Linear 
MOLECULE TYPE: Genomic DNA 
PROPERTIES: Oligonucleotide RHcbe2 



GAGCTCGAAT TCGGATCCTT ATTCGCCTTT TTTGTATGCG 
TTTTTGATGA TAGCGTTTTT GAACAGAGTA ACCAG 



SEP ID NO lQ 

SEQUENCE TYPE: Nucleotide 
SEQUENCE LENGTH: 75 

STRANDEDNESS: Double-stranded 
TOPOLOGY: Linear 
MOLECULE TYPE: Genomic DNA 
PROPERTIES: Synthetic oligonucleotide 



AAGCTTCTAG AGATCGCCAT GTACGGTGGT TTCCTGACCT 
CCGAAAAATC TCAGACCCCG CTGGTTACTC TGTTC 
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SE P ID NO-.ll 

s SEQUENCE TYPE: Nucleotide 

SEQUENCE LENGTH: 75 

STRANDEDNESS: Double-stranded 
10 TOPOLOGY: Linear 

MOLECULE TYPE: Genomic DNA 

PROPERTIES: Synthetic oligonucleotide 

15 

GAGCTCGAAT TCGGATCCTT ATTCGCCTTT TTTGTATGCG 40 
20 IT 11 1 GATGA TAGCGTTTTT GAACAGAGTA ACCAG 75 



25 SEP ID NO:12 

SEQUENCE TYPE: Nucleotide 
*> SEQUENCE LENGTH: 135 

STRANDEDNESS: Double-stranded 

TOPOLOGY: Linear 
35 MOLECULE TYPE: Genomic DNA 

40 AAGCTTCTAG AGATCGGCAT GTACGGTGGT TTCCTGACCT 40 
CCGAAAAATC TCAGACCCCG CTGGTTACTC TGTTCAAAAA 80 
CGCTATCATC AAAAACGCAT ACAAAAAAGG CGAATAAGGA 120 

45 TCCGAATTCG AGCTC 135 

so 



55 



27 



SEP DM Mn n 



EP 0 511 747 A1 



SEQUENCE TYPE: Nucleotide 

SEQUENCE LENGTH: 75 

STRAN DEDNESS: Double-stranded 

TOPOLOGY: Linear 

MOLECULE TYPE: Genomic DNA 

PROPERTIES: Synthetic oligonucleotide 



AAGCTTCTAG AGATCGGCTA CGGTGGTTTC CTGACCTCCG 
AAAAATCrCA GACCCCGCTG GTTACTCTGT TCAAA 



SEP ID MO- 1 A 

SEQUENCE TYPE: Nucleotide 
SEQUENCE LENGTH: 72 
STRANDEDNESS: Double-stranded 
TOPOLOGY: Linear 
MOLECULE TYPE: Genomic DNA 
PROPERTIES: Synthetic oligonucleotide 



GAGCTCGAAT TCGGATCCTT ATTCGCCTTT TTTGTATGCG 
TTTTTGATGA TAGCGTTTTT GAACAGAGTA AC 
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SEOIDNO:15 

SEQUENCE TYPE: Nucleotide 
SEQUENCE LENGTH: 132 
STRANDEDNESS: Double-stranded 
TOPOLOGY: Linear 
MOLECULE TYPE: Genomic DNA 



AAGCTTCTAG AGATCGGCTA CGGTGGTTTC CTGACCTCCG 40 

AAAAATCTCA GACCCCGCTG GTTACTCTGT TCAAAAACGC 80 

TATCATCAAA AACGCATACA AAAAAGGCGA ATAAGGATCC 120 

GAATTCGAGC TC 132 



SEP ID NO:1 6 

30 

SEQUENCE TYPE: Nucleotide 
SEQUENCE LENGTH: 75 
STRANDEDNESS: Double-stranded 
TOPOLOGY: Linear 
MOLECULE TYPE: Genomic DNA 
m PROPERTIES: Synthetic oligonucleotide 

45 AAGCTTCTAG AGGATCCTAT GTACGGTGGT TTCCTGACCT 40 
CCGAAAAATC TCAGACCCCG CTGGTTACTC TGTTC 75 
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SEP ID MO- 17 



SEQUENCE TYPE: Nucleotide 
SEQUENCE LENGTH: 71 

STRANDEDNESS: Double-stranded 
10 TOPOLOGY: Linear 

MOLECULE TYPE: Genomic DNA 

PROPERTIES: Synthetic oligonucleotide 

GAGCTCGAAT TCGGATCCTC GCCTTTTTTG TATGCGTTTT 
^ TGATGATAGC GTTTTTGAAC AGAGTAACCA G 



SEP ID NO-IS 
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SEQUENCE TYPE: Nucleotide 
SEQUENCE LENGTH: 130 
STRANDEDNESS: Double-stranded 
TOPOLOGY: Linear 
MOLECULE TYPE: Genomic DNA 



AAGCTTCTAG AGGATCCTAT GTACGGTGGT TTCCTGACCT 
CCGAAAAATC TCAGACCCCG CTGGTTACTC TGTTCAAAAA 
CGCTATCATC AAAAACGCAT ACAAAAAAGG CGAGGATCCG 120 
AATTCGAGCTC 

130 



40 
80 
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SEOIDNO:19 

SEQUENCE TYPE: Nucleotide 
SEQUENCE LENGTH: 75 
STRANDEDNESS: Double-stranded 
TOPOLOGY: Linear 
MOLECULE TYPE: Genomic DNA 
PROPERTIES: Synthetic oligonucleotide 

AAGCTTCTAG AGGATCCTAT GTACGGTGGT TTCCTGACCT 
CCGAAAAATC TCAGACCCCG CTGGTTACTC TGTTC 



SEP ID NO:20 

SEQUENCE TYPE: Nucleotide 
SEQUENCE LENGTH: 75 
STRANDEDNESS: Double-stranded 
TOPOLOGY: Linear 
MOLECULE TYPE: Genomic DNA 
PROPERTIES: Synthetic oligonucleotide 

!*>• 

G AGCTCGAAT TCGGATCCTT ATTCGCCTTT TTTGTATGCG 
TTTTTGATGA TAGCGTTTTT GAACAGAGTA ACCAG 
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SEP ID MP ?. 

SEQUENCE TYPE: Nucleotide 
SEQUENCE LENGTH- 135 
STRANDEDNESS: Double-stranded 
TOPOLOGY: Linear 
MOLECULE TYPE: Genomic DNA 



AAGCTTCTAG AGGATCCTAT GTACGGTGGT TTCCTGACCT 
CCGAAAAATC TCAGACCCCG CTGGTTACTC TGTTCAAAAA 
CGCTATCATC AAAAACGCAT ACAAAAAAGG CGAATAAGGA 120 
TCCGAATTCG agctc 

13S 



40 
80 



Claims 
1. 



C temlnus of Ihe Wdin-KMing of interesl is fused loth. 

Thr Val Ser Lys lie Leu Val Lys Glu Glv Asd 1, « I* - S6quence Pro ^ Pro >-eu Ala Gly 
Met Glu Thr gTu He Asn Afc £ SpSy ^ °* ^ ^ ^ LeU Val Leu G,u *a £ 

^^^T^f^TsZZT^ 1 10 6 wherein - 

be the same or different ~""9uo U s and/or con bguous avidin-binding polypeptides, which may 

Arecomhinanthyhridpo.ypep.de according - any one of .aims 1 to 6 where, the polypepUde of interest 
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comprises a plurality of non-contiguous and/or contiguous polypeptides of interest, which may be the same 
or different. 

9. A recombinant hybrid polypeptide according to any one of claims 1 to 8 wherein the polypeptide of interest 
is an enzyme, is an antigen useful for vaccine production, or a diagnostic reagent 

10. A recombinant hybrid polypeptide according to any one of claims 1 to 9 wherein the polypeptide of interest 
has antitumor activity or has an amino acid sequence for recognition of antigens. 

11. A nucleic acid sequence coding for a hybrid polypeptide as defined in any one of claims 1 to 10 wherein 
the nucleic acid sequence comprises a nucleic acid sequence coding for an avidin-binding polypeptide 
upstream of a nucleic acid sequence coding for a polypeptide of interest. 

12. A nucleic acid sequence according to claim 1 1 wherein the nucleic acid sequence is a DNA sequence. 

is 13. A nucleic acid sequence according to claim 12 wherein the DNA sequence contains in a 5' to 3' direction 
on the coding strand a gene comprising a 5' promoter region, the DNA sequence coding for the avidin- 
binding polypeptide and the DNA sequence coding for the polypeptide of interest 

14. A nucleic acid sequence according to daim 12 or claim 13 wherein the DNA sequence is, or is part of, an 
20 expression vector or a plasmid. 

15. A process for the production of a hybrid polypeptide as defined in any one of claims 1 to 10 comprising 
constructing a plasmid containing a nucleic acid sequence as defined in any one of claims 1 1 to 1 3, trans- 
forming the plasmid into a procaryotic or eucaryotic host cell expression system, expressing the system, 

25 contacting the hybrid polypeptide resulting from the expression system with avidin, and harvesting the re- 

sulting avidin-bound hybrid polypeptide. 

16. A process according to claim 15 wherein the expression system is either E. Coli or insect cells. 
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17. A process for the isolation of a hybrid polypeptide as defined in any one of claims 1 to 10 comprising con- 
tacting the hybrid polypeptide with avidin. 

18. A process according to any one of claims 1 5 to 1 7 wherein the avidin is monomeric avidin, tetrameric avidin 
or streptavidin. 

35 19. A process according to any one of claims 15 to 18 wherein the polypeptide of interest is cleaved from the 
isolated hybrid polypeptide. 

20. A process according to any one of claims 1 5 to 1 9 wherein the hybrid polypeptide is isolated using avidin 
covalentJy bound to a chemically inert, solid, water and solvent insoluble substrate through a chemically 

40 stable non-hydrolyzable linking group, preferably the hybrid polypeptide is isolated using avidin monomer 

affinity chromatography. 

21. A first kit comprising a hybrid polypeptide as defined in any one of claims 1 to 10 and avidin. 

^ 22. A second kit comprising a nucleic acid sequence as defined in any one of claims 1 1 to 14 and avidin. 

23. A third kit comprising a nucleic acid sequence which codes for an avidin-binding polypeptide containing 
a biotin attachment domain and which is fusable to a nucleic acid sequence coding for a polypeptide of 
interest in order to form a hybrid nucleic acid sequence as defined in any one of daims 11 to 14 and avidin. 

50 24. A kit according to claim 23 wherein the kit comprises means to fuse the nudeic acid sequence coding for 
the avidin-binding polypeptide to the nudeic acid sequence coding for the polypeptide of interest in order 
to form the nudeic acid sequence as defined in any one of daims 1 1 to 14. 

25. A kit according to any one of claims 21 to 24 wherein the kit comprises means to deave the polypeptide 
55 of interest from the avidin-binding polypeptide. 
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Ptac1.3dp. 



Fig.1. 
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